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ABSTRACT 

Special AT-rich sequence-binding protein 1 (SATB1) 
is a global chromatin organizer and gene expression 
regulator essential for T-cell development and 
breast cancer tumor growth and metastasis. The 
oligomerization of the N-terminal domain of SATB1 
is critical for its biological function. We determined 
the crystal structure of the N-terminal domain of 
SATB1. Surprisingly, this domain resembles a ubi- 
quitin domain instead of the previously proposed 
PDZ domain. Our results also reveal that SATB1 
can form a tetramer through its N-terminal domain. 
The tetramerization of SATB1 plays an essential role 
in its binding to highly specialized DNA sequences. 
Furthermore, isothermal titration calorimetry results 
indicate that the SATB1 tetramer can bind simultan- 
eously to two DNA targets. Based on these results, 
we propose a molecular model whereby SATB1 
regulates the expression of multiple genes both 
locally and at a distance. 



INTRODUCTION 

Global gene regulation is essential for cell differentiation 
and maturation. Local and long-range coordinated gene 
regulations provide flexibility for the coregulation and 
modulation of related genes to meet various biological 
requirements during development or tumors growth and 
metastasis (1). The chromatin architecture in the nucleus 
plays an important role in regulating gene expression (2). 
The sequences of the nuclear matrix attachment regions 
(MARs), also referred to as base-unpairing regions 
(BURs), are generally 70% AT rich (3). The unwinding 
property of BURs has been shown to be essential for 
binding to the nuclear matrix and enhancing promoter 



activity (4). Special AT-rich sequence-binding protein 1 
(SATB1) binds to the core unwinding elements of BURs 
and recruits various chromatin remodeling/modifying 
enzymes to regulate gene expression by directly influen- 
cing various gene promoter activities (5-8). Strikingly, 
SATB1 is also able to regulate gene expression at long 
distances, up to several hundred kb (9-11). A recent 
report showed that SATB1 is aberrantly expressed in 
human metastatic breast cancer and coordinately regu- 
lates the expression of sets of genes that promote breast 
cancer tumor growth and metastasis (12). 

SATB1 was initially identified as a cell type-specific 
MAR DNA-binding protein, predominantly expressed in 
the thymus (3). SATB1 consists of an N-terminal domain, 
a C-terminal homeodomain (HD) and tandem CUT 
domains in the center (Figure 1A). The sequence-specific 
binding of SATB1 to its DNA targets is mediated by the 
HD and CUT tandem domains (13,14) and requires oligo- 
merization of the N-terminal domain (15,16). In addition, 
chemical interference assays suggested a rapid association 
and dissociation kinetics of DNA binding by SATB1, and 
the dissociation rate (^ G ff) for the multiple AT-rich- 
containing DNA fragments (such as seven repeats) is 
slower than the less AT-rich-containing DNA fragments 
(such as two repeats) (3). In addition to its DNA-binding 
ability, SATB1 also acts as a 'docking site' for various 
chromatin remodeling/modifying enzymes and transcrip- 
tion factors (5,7,9,11,17). The transcriptional activity of 
SATB1 is regulated by several post-translational modifi- 
cations such as phosphorylation (18), acetylation (7,18) 
and sumoylation (19). Taken together, these studies dem- 
onstrate that SATB1 acts as a linker between DNA loop 
organization, chromatin modification/remodeling and the 
association of transcription factors with MARs, and thus 
it functions as a 'genome organizer' that is essential for 
T-cell development (17,20). 

A structural study of the SATB1 CUT1 domain bound 
to DNA showed that the CUT1 domain binds to the 
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Figure 1. Structure of the SATB1 ULD. (A) Schematic representation of the domain organization of mouse SATB1. The ULD domain boundary 
identified in this work is located from Gly71 to Serl72, and a novel CUTL domain is located from His 186 to Lys244. The two mutants used in this 
study, KWN-AAA and EFH-AAA, were created by substituting the 136 K 137 W 138 N and the 97 E 98 F 162 H motifs with 'AAA' cassettes. (B) Cartoon 
representation of the overall structure of ULD. The N- and C-termini of the protein are labeled. (C) Stereo view showing the superimposed structures 
of ULD and ubiquitin (PDB code: 1UBI). 



major groove of DNA (21). However, the molecular 
mechanism of how SATB1 oligomerization regulates 
DNA binding has remained elusive. In this study, we 
present the crystal structure of the N-terminus of 
SATB1. Surprisingly, it resembles a ubiquitin domain 
instead of the previously identified PDZ domain. We 
also found that SATB1 can assemble into a tetramer, 
which provides insight into molecular basis of its ability 
to regulate gene expression at long distances. 

MATERIALS AND METHODS 

Expression and purification of various N-terminal 
fragments of SATB1 

Various fragments (Supplementary Data and 
Supplementary Figure SI) of the mouse SATB1 gene 



were PCR amplified from a mouse thymus cDNA 
library and cloned into an in-house modified version of 
the pET32a (Novagen) vector and confirmed by DNA 
sequencing. The resulting protein contained a His 6 -tag in 
its N-terminus. All point mutations of SATB1 described 
here were created using the standard PCR-based mutagen- 
esis method and confirmed by DNA sequencing. The 
recombined protein was expressed in BL21 (DE3) 
Escherichia coli cells at 16°C for 16-18 h. The His 6 - 
tagged protein was purified by Ni-NTA (QIAGEN) 
affinity chromatography followed by size exclusion chro- 
matography on a HiLoad 26/60 Superdex 200 (GE 
Healthcare). After digestion with PreScission Protease to 
cleave the N-terminal His 6 -tag, the target protein was 
purified on a Mono Q 10/100 GL (GE Healthcare) 
anion-exchange column. The final purification step was 
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size exclusion chromatography on a HiLoad 26/60 
Superdex 200 column in 50 mM Tris pH 8.0, 50 mM 
NaCl, 1 mM EDTA and 1 mM DTT. 

Se-Met-recombined protein was expressed in B834 
(DE3) E. coli cells at 20°C for 20 h. The B834 (DE3) 
cells were cultured in LeMaster medium in which methio- 
nine was replaced by selenomethionine. The Se-Met 
substituted protein was purified as wild-type protein as 
described above. 

Expression and purification of full-length protein 

The outline for the expression and purification of 
full-length protein is summarized in Supplementary 
Figure S7. Briefly, SATB1 and its mutants were expressed 
in E. coli as a fusion protein with an N-terminal Trx-His 6 - 
tag and C-terminal His 6 -tag. The Trx-His 6 -tagged protein 
was purified by Ni-NTA affinity chromatography 
followed by size exclusion chromatography on a HiLoad 
26/60 Superdex 200 column. After digestion with 
PreScission Protease to cleave the N-terminal Trx-His 6 - 
tag, the target protein was purified on a Mono Q 10/100 
GL anion-exchange column. The protein was then 
denatured with 6 M guanidine hydrochloride and 
purified by Ni-NTA affinity chromatography under 
denaturing conditions by its C-terminal His 6 -tag. The 
eluted protein was refolded by extensively dialyzing out 
the denaturant in 50 mM Tris pH 8.0, 100 mM NaCl, 
ImM EDTA and 1 mM DTT at 4°C for 12 h at least 
three times. The refolded protein was further purified on 
a Mono Q 10/100 GL anion-exchange column followed by 
size exclusion chromatography on a HiLoad 26/60 
Superdex 200 column in 50 mM Tris pH 8.0, 100 mM 
NaCl, 1 mM EDTA and 1 mM DTT. 

Crystallization and data collection 

The wild-type protein was crystallized using the sitting 
drop vapor diffusion method equilibrated against a reser- 
voir solution of 15% polyethylene glycol 3350, 0.15 M 
ammonium citrate dibasic and 4% polypropylene glycol 
P400. Selenium-substituted protein was crystallized under 
similar conditions to the native protein, except that 4% 
polypropylene glycol P400 was replaced by 3% dextran 
sulfate. Crystals grew at 20° C and were frozen in a cryo- 
protectant solution consisting of the reservoir solution 
supplemented with 15% glycerol. All crystals belonged 
to the space group of P2 1 2 1 2 1 . The wild-type crystals dif- 
fracted to 1.7 A with unit cell dimensions of a = 35.90 A, 
6 = 71.03 A and c = 153.76 A and Se-Met substituted 
crystals diffracted to 2.1 A with unit cell dimensions of 
<2 = 36.14A, 6 = 71.23 A and c = 154.23 A. A native 
data set and a single anomalous dispersion (SAD) data 
set were collected at the peak wavelength for Se on 
station BL17U1 of the Shanghai Synchrotron Radiation 
Facility (SSRF). Both data sets were processed using 
HKL2000 software (22). 

Structure determination and refinement 

The program HKL2MAP (23) was used to search for 16 
Se sites and the initial SAD phases were then calculated 
using PHENIX program (24). A model covering 60% of 



the protein molecule was automatically built into the SAD 
map using the PHENIX program (24), and additional 
residues were manually built into the electron density 
with the Coot program (25). Then, molecular replacement 
by the Phaser program (26) was performed to locate the 
exact position of four protein molecules in one asymmetric 
unit of the wild-type data. The final tetramer model was 
refined iteratively by the CNS (27) and the Coot programs 
(25). The orientations of the amino acid side chains and 
bound water molecules were modeled based on sigmaA 
weighted 2F obs -F calc and F obs -^ calc Fourier electron 
density maps. The final structure had an ^ crystal value of 
15.8% and an 7? free value of 21.8%. The statistics on the 
structure refinement are summarized in Table 1. 

Analytical ultracentrifugation 

Sedimentation velocity (SV) and sedimentation equilib- 
rium (SE) experiments were performed in a Beckman/ 
Coulter XL-I analytical ultracentrifuge using double- 
sector or six-channel centerpieces and sapphirine 
windows. An additional protein purification step on a 
HiLoad 26/60 Superdex 200 size exclusion column in 
50 mM Tris pH 8.0, lOOmM NaCl, ImM EDTA and 
1 mM TCEP was performed before the experiments. SV 
experiments were conducted at 50 000rpm and 20° C using 
absorbance detection and double-sector cells loaded with 
~80uM for ubiquitin-like domain (ULD). For the 
full-length protein, SV experiments were conducted at 
40 000rpm and 4°C with 7.5 and 15uM protein. For the 
SE experiment, data were collected at 20° C and 
18 000rpm with -16, 24 and 40 uM ULD, at 4°C and 
4300, 6400 and 8000 rpm with 2.9, 4.4 and 7.3 uM 
wild-type full-length SATB1 and at 4°C and 6000, 8000, 
10 000 and 12 000 rpm with 3.3, 4.9 and 8.2 uM KWN- 
AAA mutant, respectively. The buffer composition 
(density and viscosity) and protein partial specific 
volume (V-bar) were obtained using the program 
SEDNTERP (http://www.rasmb.bbri.org/). The SV and 
SE data were analyzed using the programs SEDFIT and 
SEDPHAT (28,29). 

Electrophoretic mobility shift assay 

The forward and reverse oligonucleotides for a particular 
set were mixed together. To anneal the labeled oligo- 
nucleotides, the mixtures were heated to 95°C for lOmin 
and allowed to cool slowly at room temperature. 
Annealed oligonucleotides were end labeled with y- 32 P 
ATP using T4 polynucleotide kinase (New England 
Biolabs). Binding reactions were performed in a lOul 
total volume containing lOmM Tris pH 7.5, ImM 
DTT, 50 mM KC1, 5mM MgCl 2 , 2.5% glycerol, 0.05% 
NP-40 and the appropriate amount of annealed oligo- 
nucleotides and recombinant proteins. Samples were 
incubated on ice for 1 h and separated by electrophoresis 
on a 5% native polyacrylamide gel (PAGE). The gels were 
dried under vacuum and exposed to X-ray film. The radio- 
active intensities of protein-bound DNA bands and free 
DNA probe reduction due to protein binding were 
calculated with Adobe Photoshop CS4 and normalized 
to that of the free DNA probe. 
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Table 1. Data collection and refinement statistics for SATB1 structures 
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Analytical gel filtration 

Size exclusion chromatography was performed on an 
AKTA FPLC system using a Superose 12 10/300 
column (GE Healthcare) for ULD or Superdex 200 10/ 
300 column (GE Healthcare) for full-length SATB1. 
Protein samples were dissolved in buffer containing 
50 mM Tris pH 8.0, 100 mM NaCl, 1 mM EDTA and 
1 mM DTT. The column was calibrated with a gel filtra- 
tion standard from Bio-Rad. 



Circular dichroism 

Circular dichroism (CD) spectra of various proteins were 
collected on a MOS450 spectropolarimeter (BioLogic) at 
room temperature. The protein samples (~6|iM) were 
dissolved in 50 mM Tris pH 8.0, 100 mM NaCl, 1 mM 
EDTA and 1 mM DTT. 

Isothermal titration calorimetry 

Isothermal titration calorimetry (ITC) measurements were 
performed on a MicroCal M Isothermal Titration 
Calorimeter iTC200 (GE Healthcare) in 20 mM PBS pH 
7.2 and 50 mM NaCl. For wild-type SATB1, -232 uM of 
DNA probe was titrated into 11.7 |iM of protein. For the 



KWN-AAA mutant, 239 uM of DNA probe was titrated 
into 16|iM of protein. The titration consisted of an initial 
injection of 0.4 |il followed by 26 injections of 1.5 |il every 
120 s at 15°C. To determine the baseline, the DNA probe 
was titrated into the same buffer without protein 
under the same conditions. The titration data and 
binding plot after baseline subtraction were analyzed 
with the MicroCal Origin software with the two-site and 
one-site-binding models for wild-type SATB1 and the 
KWN-AAA mutant, respectively. 

Chemical cross-linking assay 

Chemical cross-linking assay was carried out by incu- 
bating (1-263)AA or SATB1 proteins with Lys-specific 
cross-linker disuccinimidyl glutarate (DSG, Thermo 
Pierce). The reaction was carried out in 20 mM Hepes 
buffer (pH 7.5) containing various concentrations of 
NaCl and glycerol with a protein concentration of 
0.5mg/ml for 15min on ice. The concentration of DSG 
was adjusted to be 5 M equivalent of total Lys concentra- 
tion of each protein. The cross-linking reactions were 
quenched by addition of 50 mM Tris to the reaction 
solution. GST and MBP were served as positive and 
negative control, respectively. 
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RESULTS 

Overall structure of the ULD domain of SATB1 

To identify a stable region of the N- terminus of SATB1 
for crystallization, several constructs were designed and 
tested for protein expression and purification. Among 
them, seven purified proteins formed tetramers, as 
determined by analytical ultracentrifugation SV experi- 
ments (Supplementary Figure SI). After extensive crystal 
screening, high-quality crystals were obtained only with 
the construct containing residues 71-172. The crystal 
structure of residues 71— 172 o of SATB1 was solved by 
SAD at a resolution of 1.8 A (Table 1). Strikingly, the 
overall structure of this region contains four antiparallel 
(3-sheets ((31-(34) flanked by four oc-helices (ocl-oc4) 
(Figure IB), and does not show any similarity to the pre- 
viously defined PDZ domain (15). Instead, it resembles a 
typical ubiquitin domain. The superposition of the 
N-terminal domain of SATB1 onto the classic ubiquitin 
domain results in a root-mean-square deviation of 2.7 A 
for the 67 Coc-atoms (Figure 1C), even though the two 
proteins share no sequence homology. Therefore, we 
renamed this domain a ULD (Figure 1A). 

Prior studies have proposed that residues 90-204 of 
SATB1 form a PDZ-like domain based on primary 



sequence alignment (15). So far, two structures of 
residues 179-250 of SATB1 have been deposited in the 
Protein Data Bank with accession codes 3NZL and 
2L1P. Interestingly, the structure of residues 179-250 re- 
sembles the CUT1 domain of SATB1, and this region was 
designated a CUT 1 -like (CUTL) domain (Supplementary 
Figure S2 and Figure 1A). Therefore, according to current 
structural studies, it appears that the N-terminus of 
SATB1 does not contain a PDZ domain. A sequence 
alignment of SATB family proteins (SATB1 and 
SATB2) across species shows that the residues of the 
ULD domain are all highly conserved (Supplementary 
Figure S3), indicating that the ULD domain we identified 
in this study may exist in all SATB family members and 
have similar biological functions. 

ULD-mediated SATB1 tetramerization 

Our crystal structure contains four ULD molecules, which 
seem to form a tightly packed tetramer, in one asymmetric 
unit (Figure 2A). Consistent with this observation, 
ULD was eluted as a single peak from a size exclusion 
column with a molecular mass corresponding to a 
tetramer (Supplementary Figure S4A). Analytical ultra- 
centrifugation further confirmed that ULD assembles 
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Figure 2. Interface of the SATB1 ULD homotetramer. (A) Ribbon diagram of a representative SATB1 ULD homotetramer. The four individual 
ULDs are colored red, cyan, green and yellow. Schematic diagrams showing the detailed interactions within the cyan-yellow dimer (B) and the 
green-yellow dimer (C). The color of the residues within each dimer interface is same as that of the individual ULDs in (A). Hydrogen bonds and 
hydrophobic interactions are shown as red and gray dotted lines, respectively. 
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into a tetramer with a molecular mass of ~43.5kDa 
(Supplementary Figure S4B). These results demonstrate 
that ULD forms a homotetramer in solution. 

In the structure of the ULD tetramer, two dimers 
(cyan-yellow and green-yellow) are formed with 
multiple hydrogen bonds and hydrophobic interactions 
within their interfaces (Figure 2B and C, Supplementary 
Figure S5). It has been noted that high concentra- 
tions of NaCl and glycerol disrupt neither the ULD 
tetramer nor ULD-mediated full-length SATB1 tetramer 
(Supplementary Figure S6). The buried surface area 
calculated by o the program AREAIMOL (30) is 1343.6 
and 1647.5 A 2 within the interface of the cyan-yellow 
and green-yellow dimers, respectively, indicating that 
both dimers could play a role in the oligomerization of 
full-length SATB1. The interactions within the interface of 
both dimers were analyzed in detail. These interactions 
display non-crystallographic 2-fold symmetry. Within 
the interface of the cyan-yellow dimer (Figure 2B and 
Supplementary Figure S5A), Glu97 from the cyan 
monomer makes several hydrogen bonds with Thr72 
from the yellow monomer. Additionally, Phe98 and 
His 162 from the cyan monomer form stacking interactions 
with Pro75 and His 162 from the yellow monomer, respect- 
ively. Within the green-yellow interface (Figure 2C and 
Supplementary Figure S5B), three residues (Del 32, 
Vail 34 and Vail 45) from the green monomer form a 
hydrophobic pocket to accommodate the bulky side 
chain of Trpl37 from the yellow monomer. Atom NE1 
of the side chain of Trpl37 from the green monomer 
forms a hydrogen bond with the main chain oxygen 
atom of Met 156 from the yellow monomer. Asnl38 and 
Asp 159 from the green monomer make hydrogen bonds 
with Asp 147 and Lysl36 from the yellow monomer, re- 
spectively. All of the above interactions are also recipro- 
cal. Therefore, the < 97 e 98 F 162 H' motif (residue Glu97, 
Phe98 and His 162) is presumably important for the for- 
mation of the cyan-yellow ULD dimer, and the 
< 136 K 137 W 138 N' motif (residues 136-138) is important for 
the formation of the green-yellow ULD dimer. For the 
convenience of further discussion, the cyan-yellow ULD 
dimer interface (Figure 2B) and the green-yellow ULD 
dimer interface (Figure 2C) are referred to as the EFH 
and KWN interfaces, respectively. 

To investigate the oligomerization of SATB1 in 
solution, we mutated residues within the two interfaces to 
test the ULD domain-mediated oligomerization state of 
full-length SATB1. The first mutant was designed to 
disrupt the EFH interface-mediated ULD dimer by 
replacing the 97 E 98 F 162 H motif with an 'AAA' cassette, 
and the second mutant was designed to disrupt the 
KWN interface-mediated ULD dimer by substituting 
the 136 K 137 W 138 N motif with an 'AAA' cassette (referred 
to as EFH-AAA and KWN-AAA, respectively) 
(Figure 1A). Wild- type SATB1 and the two mutants 
were purified to homogeneity by the protocol summarized 
in Supplementary Figure S7 (Figure 3A, inset) and 
analyzed by SV analysis. Wild- type SATB1 exhibited a 
narrow sedimentation coefficient distribution in continu- 
ous size distribution analysis (black lines in Figure 3A) 
(28), suggesting that it is mono-disperse, and it is 
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Figure 3. Oligomeric assembly of wild-type and SATB1 mutants 
measured by analytical ultracentrifugation. (A) c(S) distributions from 
SV runs for SATB1 (7.5 uM, dotted black line; 15 uM, solid black line), 
EFH-AAA (7.5 uM, dotted cyan line; 15uM, solid cyan line) and 
KWN-AAA (7.5 uM, dotted red line; 15 uM, solid red line). 
Representative SE profiles for SATB1 (B) and KWN-AAA (C) 
derived from a global fit MW ~344.4kDa (la confidence interval 
342.6-345.8) and ~174.5kDa (la confidence interval 174.1-174.9), 
indicating that wild-type SATB1 and the KWN-AAA mutant 
assemble into a tetramer and dimer in solution, respectively. 
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estimated to be a stable tetramer. SE analysis of ana- 
lytical ultracentrifugation further confirmed that wide- 
type full-length SATB1 assembled into a tetramer 
with a molecular mass of ~344.4kDa (Figure 3B and 
Supplementary Figures S8A-S8C). The KWN-AAA 
mutant also exhibited a mono-disperse sedimentation co- 
efficient distribution (red lines in Figure 3A), suggesting 
that it is a dimer. The dimeric state of the KWN-AAA 
mutant in solution was confirmed by SE analysis, yielding 
a calculated molecular weight of M74.5kDa (Figure 3B 
and Supplementary Figures S8D-S8F). In sharp con- 
trast, the EFH-AAA mutant exhibited a wide range of 
sedimentation coefficients in the c(S) distribution (cyan 
lines in Figure 3A) indicating that it exists as multiple 
forms of oligomers including dimmers, tetramers and 
even high-order form. These data suggest that this 
mutation drastically interferes with the oligomerization 
of SATB1 and that the EFH-AAA mutant does not 
have a unique form of oligomers, which is consistent 
with its elution profile by size exclusion chromatography 
(Supplementary Figure S9). Taken together, we conclude 
that both dimers in the ULD tetramer contribute to the 
oligomerization of full-length SATB1. 

SATB1 oligomerization is required for DNA binding 

SATB1 has been shown to bind to AT-rich DNA of BURs 
through its C-terminal tandem CUT domains and HD 
(13,14) and lost its DNA binding activity after deletion 
the N-terminal 248 amino acids of SATB1 (15). We there- 
fore investigated the role of ULD domain-mediated oligo- 
merization in SATBl's DNA-binding activity. To 
accomplish this, we assessed the ability of wild-type 
SATB1 and the KWN-AAA and EFH-AAA mutants 
to bind a 37-bp DNA fragment known to bind wild-type 
SATB1 (Figure 4A) (6) via an electrophoretic mobility 
shift assay (EMSA). The DNA-binding affinity of 
wild-type SATB1 to the 37-bp DNA was stronger than 
that of the KWN-AAA and EFH-AAA mutants in 
dose-dependent EMSAs varying either the protein or the 
DNA probe concentration (Figure 4B and C). Thus, our 
data indicate that ULD-mediated oligomerization of 
SATB1 is required for binding to target DNA. 

To quantitatively study the binding of wild-type and 
mutant SATB1 to target DNA, we used ITC. The 
binding of wild-type SATB1 to DNA fit well to a 
two-site binding model with calculated K d values of 
-0.36 and ~1.99uM, indicating that the SATB1 
tetramer is capable of binding to two DNA fragments 
simultaneously (Figure 4D). However, the binding of the 
KWN-AAA mutant to target DNA fit well to a one-site 
binding model with a K d of ~ 1.95 uM (Figure 4E). The 
EFH-AAA mutant was not amenable to analysis by ITC 
because it exists as multiple forms of oligomers (Figure 3A 
and Supplementary Figure S9). These ITC results indicate 
that the wild-type SATB1 can possibly bind to two DNA 
segments in an allosteric interaction with one comparable 
and another 5-fold increase binding affinity comparing to 
the KWN-AAA mutant binding to DNA, which is con- 
sistent with the observation in EMSA assays that the 



wild-type SATB1 binds to the target DNA stronger than 
the KWN-AAA mutant does. 



DISCUSSION 

Oligomerization of SATB 1 plays a very important role in 
DNA binding and has been implicated in gene regulation 
by SATB1. The structure of the SATB1 ULD domain 
reported here provides the molecular basis for how the 
ULD domain mediated the oligomerization state of 
SATB1. SATB1 assembles into a tetramer in vitro 
(Figure 5A), and the tetramerization of SATB1 is essential 
for recognizing specific DNA sequences (such as multiple 
AT-rich DNA fragments). Thus, SATB1 may regulate 
gene expression directly by binding to various promoters 
and upstream regions and thereby influencing promoter 
activity (Figure 5B). This local gene regulation model is 
consistent with experimental observations that SATB1 
directly regulates the expression of a number of genes, 
including globin, interleukin-2, interleukin-2 receptor oc 
and interleukin-5, by recruiting either coactivators or 
corepressors (5-8). Furthermore, we showed that the 
SATB1 tetramer can simultaneously bind to two DNA 
segments, and thus the tetramerization of SATB1 may 
organize high-order chromatin architecture by anchoring 
specialized DNA sequences in close proximity and recruit- 
ing various chromatin remodeling factors to coordinately 
regulate gene expression over long distances (Figure 5C). 
This long-range gene regulation model is also consist- 
ent with the observations that SATB 1 regulates the coordi- 
nated expression of genes located both at the 200-kb 
T-helper 2 cytokine locus (10) and at the 300-kb major 
histocompatibility Class I locus (11), and that it repro- 
grams chromatin organization and the transcriptional 
profiles of breast tumors to promote growth and me- 
tastasis (12,31,32). 

Long-range gene regulation plays a role in many bio- 
logical activities such as regulation of the (3-globin locus 
(33), cytokine gene cluster (34), estrogen-induced gene ex- 
pression (35) and mating- type switching in yeast (36). 
Moreover, it has been found that the CI protein of bac- 
teriophage X regulates gene expression over a long 
distance via cooperative binding of its oligomers to 
specific target DNA (37). Our proposed model for 
SATB1 oligomerization-mediated long-range gene regula- 
tion is consistent with their finding and likely represents a 
general mechanism for spatiotemporal and quantitative 
regulation of gene expression. 

Any assembly of a functional protein complex in a 
living cell must be dynamically regulated. The oligomeric 
assembly of SATB1 is not an exception. The mechanism 
that regulates the dynamic assembly of SATB1 tetramers 
remains unclear. Understanding the dynamic regulatory 
assembly mechanism of SATB1 (or SATB2) is an import- 
ant area of future research. SATB1 is known to be 
acetylated by P300/CBP-associated factor at residue 
Lysl36, located just within the 136 K 137 W 138 N motif, 
which is important for SATB1 tetramerization and 
to mediate gene regulation in coordination with 
C-terminal-binding protein 1 during Wnt signaling in 
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Figure 4. DNA binding affinity of wild-type and SATB1 mutants measured by EMSA and ITC. (A) Nucleotide sequence of the interleukin-2 (IL-2) 
promoter region spanning base pairs —447 to —441 from the translation site. (B and C) EMSA assay. The binding of SATB1 and its mutants to a 
target DNA probe was tested by EMSA using the radiolabeled synthetic duplex oligonucleotides shown in (A). The EMSA was carried out with 
various protein (B, top) or probe concentrations (C, top). Bar graph of DNA binding affinity from the dose-dependent EMSA of protein (B, bottom) 
or DNA probe (C, bottom). The radioactive intensities of protein-bound DNA bands and free DNA probe reduction due to protein binding were 
calculated with Adobe Photoshop CS4 and normalized to that of the free DNA probe. The error bars indicate the standard error mean (n = 3 
separate experiments). *P<0.05; **P-value < 0.01; ***P< 0.005. The dissociation constants (X d 's) of SATB1 (D) and the KWN-AAA mutant 
(E) with IL-2 37-bp DNA shown in (A) were measured by ITC. The ^ d 's were calculated by a two-site binding model for wild-type SATB1 to 
give K d l of ~0.36fiM and K d 2 of ~1.99fiM. For the KWN-AAA mutant, the data were fit to a one-site binding model to give a K d of ~1.95fiM. 
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Figure 5. Model of SATB1 -mediated transcriptional regulation. (A) 
Schematic showing SATB1 assembles into a tetramer by oligomeriza- 
tion of its N-terminal ULD domain. (B and C) Schematic representa- 
tion of a possible model for SATB1 oligomer-mediated transcriptional 
regulation. The SATB1 dimer or tetramer may regulate gene expression 
by recognizing specific DNA sequences in the promoter regions of 
various genes (B), and the SATB1 tetramer may organize higher-order 
chromatin architecture to coordinately regulate gene expression over 
long distances by anchoring specialized DNA sequences in close prox- 
imity and recruiting various chromatin remodeling factors (bottom). 

T cells (7). The identification of key residues (i.e. 

97 E 98 p 162 H 136 K 137 W 138 N motifs ) in the assem bly 0 f 

SATB1 oligomers (and likely SATB2) should be helpful in 
designing mutants of SATB family proteins to evaluate 
their functional roles in living cells and/or animal models. 

The results of EMSA and ITC indicate that the 
ULD-mediated SATB1 oligomerization can affect the 
DNA-binding affinity and stoichiometry for SATB1 
(Figure 4). It is possible that, similar to DNA binding, 
the ULD-mediated SATB1 oligomerization may also 



affect the binding affinity and stoichiometry for its 
various protein-binding partners. Furthermore, the ITC 
data showed that SATB1 may allosterically bind to two 
DNA fragments, indicating that SATB1 binds to multiple 
AT-rich-containing DNA fragments with higher affinity 
by its tetramerization. In addition, a previous study sug- 
gested a rapid association and dissociation kinetics of 
DNA binding by SATB1 (3). Whether the oligomerization 
of SATB1 influencing the kinetic constants of 
DNA-binding by SATB1 needs to be investigated in the 
future study by other technique such as surface plasmon 
resonance. 

Although the N- terminus of SATB1 (residues 90-204) 
was previously identified as a PDZ domain (15), structural 
studies have shown that this region is made up of ULD 
and CUTL domains. Detailed sequence alignments show 
that CUTL has the evolutionarily conserved amino acids 
involved in CUT1 DNA binding (Supplementary Figures 
S2B and S3). It would be interesting to investigate the 
functional role of the CUTL domain of SATB1 in future 
studies. 
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